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Abstract — Least mean square-partial parallel interference can- 
celation (LMS-PPIC) is a partial interference cancelation using 
adaptive multistage structure in which the normalized least mean 
square (NLMS) adaptive algorithm is engaged to obtain the 
cancelation weights. The performance of the NLMS algorithm 
is mostly dependent to its step-size. A fixed and non-optimized 
step-size causes the propagation of error from one stage to 
the next one. When all user channels are balanced, the unit 
magnitude is the principal property of the cancelation weight 
elements. Based on this fact and using a set of NLMS algorithms 
with different step-sizes, the parallel LMS-PPIC (PLMS-PPIC) 
method is proposed. In each iteration of the algorithm, the 
parameter estimate of the NLMS algorithm is chosen to match 
the elements' magnitudes of the cancelation weight estimate with 
unity. Simulation results are given to compare the performance 
of our method with the LMS-PPIC algorithm in three cases: 
balanced channel, unbalanced channel and time varying channel. 



I. Introduction 

Multiuser detectors for code division multiple access 
(CDMA) receivers are effective techniques to eliminate the 
multiple access interference (MAI). In CDMA systems, all 
users receive the whole transmitted signals concurrently that 
are recognized by their specific pseudo noise (PN) sequences. 
In such a system, there exists a limit for the number of users 
that are able to simultaneously communicate. This limitation 
is because of the MAI generated by other users (see e.g. [1], 
[2]). High quality detectors improve the capacity of these 
systems [1], [6]. However their computational complexities 
grow exponentially with increasing the number of users and 
the length of the transmitted sequence [7]. 

Multiple stage subtractive interference cancelation is a sub- 
optimal solution with reduced computational complexity. In 
this method and before making data decisions, the estimated 
interference from other users are removed from the specific 
user's received signal. The cancelation can be carried out either 
in a serial way (successively) (see e.g. [8], [9]) or in a parallel 
manner (see e.g. [2], [3], [10]). The parallel interference 
cancelation (PIC) is a low computational complex method 
that causes less decision delay compared to the successive 
detection and is much simpler in implementation. 

Usually at the first stage of interference cancelation in a 
multiple stage system, the interfering data for each user which 
is made by other users is unknown. PIC is implemented 



to estimate this data stage by stage. In fact when MAI is 
estimated for each user, the bit decision at the (s — 1)'^ 
stage of cancelation are used for bit detection at the s'^ stage. 
Apparently, the more accurate the estimates are, the better 
performance of the detector is. However, in the conventional 
multistage PIC [3], a wrong decision in one stage can increase 
the interference. Based on minimizing the mean square error 
between the received signal and its estimate from the previous 
stage, G. Xue et al. proposed the least mean square-partial 
parallel interference cancelation (LMS-PPIC) method [10], 
[11]. In LMS-PPIC, a weighted value of MAI of other users 
is subtracted before making the decision of a specific user 
The least mean square (LMS) optimization and the normalized 
least mean square (NLMS) algorithm [13] shape the structure 
of the LMS-PPIC method of the weight estimation of each 
cancelation stage. However, the performance of the NLMS 
algorithm is mostly dependent on its step-size. Although a 
large step-size results in a faster convergence rate, but it causes 
a large maladjustment. On the other hand, with a very small 
step-size, the algorithm almost keeps its initial values and can 
not estimate the true cancelation weights. In the LMS-PPIC 
method, both of these cases cause propagation of error from 
one stage to another In LMS-PPIC, the m*'' element of the 
weight vector in each stage is the true transmitted binary value 
of the m"^ user divided by its hard estimate value from the 
previous stage. Hence the magnitude of all weight elements 
in all stages are equal to unity. This is a valuable information 
that can be used to improve the performance of the LMS- 
PPIC method. In this paper, we propose parallel LMS-PPIC 
(PLMS-PPIC) method by using a set of NLMS algorithms with 
different step-sizes. The step-size of each algorithm is chosen 
from a sharp range [14] that guarantees stable operation. 
While in this paper we assume coherent transmission, the non- 
coherent scenario is investigated in [5]. 

The rest of this paper is organized as follows: In section HIl 
the LMS-PPIC [10] is reviewed. The LMS-PPIC algorithm 
is an important example of multistage parallel interference 
cancelation methods. In section|IIIl the PLMS-PPIC method is 
explained. In section |IV] some simulation examples are given 
to compare the results of PLMS-PPIC with those of LMS- 
PPIC. Finally, the paper is concluded in section [V] 



II. Multistage Parallel Interference 
Cancelation: LMS-PPIC Method 

We assume M users synchronously send their symbols 
ai, a2j • • • 7 ctM via a base-band CDMA transmission system 
where a„i G {^Ij !}■ The m*'* user has its own code Pm{-) 
of length N, where Pm{n) e {—1,1}, for all n. It means 
that for each symbol N bits are transmitted by each user and 
the processing gain is equal to N. At the receiver we assume 
that perfect power control scheme is applied. Without loss of 
generality, we also assume that the power gains of all channels 
are equal to unity and users' channels do not change during 
each symbol transmission (it can change from one symbol 
transmission to the next one) and the channel phase (j)m of 
m*^ user is known for all m = 1, 2, • • • , Af (see [5] for non- 
coherent transmission). We define 

cM = e^^'-pmin). (1) 
According to the above assumptions, the received signal is 

M 

r{n) = amCjn{n) +v{n), n = 1, 2, • • • , iV, (2) 

m— 1 

where v{n) is additive white Gaussian noise with zero mean 
and variance a^. In order to make a new variable set 
af , , • • • , aj^j for the current stage s, multistage parallel 
interference cancelation method uses a^~^, aj^^, • • • , a^J^ 
(the bit estimate outputs of the previous stage s— 1) to estimate 
the related MAI of each user, to subtract it from the received 
signal r{n) and to make a new decision on each user variable 
individually. The output of the last stage is considered as 
the final estimate of the transmitted bits. In the following 
we explain the structure of the LMS-PIC method. Assume 
am~^^ e { — 1, 1} is a given estimate of am from stage s—l. 
Let us define 

am 



From (|2]| and Q we have 



M 



m— 1 



(4) 



Define 



W = [wlw^,---,<iV , (5a) 

where T stands for transposition. From equations (HI, ( |5al ) and 
dSbl i. we have 



r{n) = W X'{n) + v{n). 



(6) 



Given the observations {r{n), {n)}^^i, an adaptive algo- 
rithm can be used to compute 



which is an estimate of after N iterations. Then af, 
estimate of am at stage s, is given by 



N 



the 



(8) 



a™ = sign I^Re ^Yl^qm{n)Cm{n) 
where (.)* stands for complex conjugation and 

M 

qUn)=r{n)~ ^ w;^ (7V)a^r''c„' (n). (9) 

m' — l,m' T^m 

The inputs of the first stage {am}m=i (needed for computing 
X^{n)) is given by the conventional bit detection 

=sign |^Re|^^r(n)c;,(n)|^ , m = 1,2,---,M. 

(10) 

Given the available information {r{n), X''{n)}^^i and using 
equation there are a variety of choices for parameter 
estimation. In LMS-PPIC, the NLMS algorithm is used to 
compute W{N). TableUshows the full structure of the LMS- 
PPIC method. 

To improve the performance of the LMS-PPIC method, in 
the next section we propose a modified version of it. In our 
method a set of individual NLMS algorithms with different 
step-sizes are used. 

III. Multistage Parallel Interference 
Cancelation: PLMS-PPIC Method 

The NLMS (with fixed step-size) converges only in the 
mean sense. In the literature, ^ G (0, 2) guarantees the mean 
convergence of the NLMS algorithm [13], [15]. Based on 
Cramer-Rao bound, a sharper range was given in [14] as 
follows 



A* e * = 0, 1 - 



M -1 
M 



(11) 



where M is the length of the parameter under estimate. Here 
M is the number of users or equivalently the system load. As 
equation ( fTTT i shows, the range of the step-size is a decreasing 
function of the system load. It means that as the number of 
users increases, the step-size must be decreased and vice versa. 
In the proposed PLMS-PPIC method, ^I* has a critical role. 
From we have 



= 1 m = l,2,---,M, 



which is equivalent to 



M 

E 

m— 1 



IKJ-i| = o. 



(12) 



(13) 



To improve the performance of the NLMS algorithm, at time 
iteration n, we can determine the step size /i(n) from in 

M 

such a way that ^ ||K7^(n)| — 1| is minimized, i.e. 

m— 1 



W^{N) = [wliN),w^,{N), - ■ ■ ,wIj{N)]' 



(7) 



arg mm 




Wmin)\ 



(14) 



where w^{n), the m*'' element of ^{n), is given by (see 
Table IB 

The complexity to determine ^{n) from (fT4] i is high, especially 
for large values of M. Instead we propose the following 
method. 

We divide into L subintervals and consider L in- 
dividual step-sizes 6 = {/ii, /i2, • ■ • , Ml}, where = 

— , /i2 = 2/ii,---, and /i^ = L/ii. In each stage, L 
individual NLMS algorithms are executed {^i is the step- 
size of the algorithm). In stage s and at iteration n, if 
W^{n) = fc, ■ • • ,w\,j i.]^ , the parameter estimate of the 
k^^ algorithm minimized our criteria, i.e. 

W^{n) = arg m^in | ^ - 1|| , (16) 

where VFf (n) = W%n - I) + ^;^I^e(n), Z = 
l,2,---,fc,.-.,L- l,i and - {M^f (n), • • • , iy£(n)}, 
then it is considered as the parameter estimate at time iteration 
n, i.e. Win) = W^{n) and all other algorithms replace their 
weight estimates by W^{n). Table HIl shows the details of the 
PLMS-PPIC method. As Table shows, in stage s and at 
time iteration N where ^{N) is computed, the PLMS-PPIC 
method computes from equation (O. This is similar to the 
LMS-PPIC method. Here the PLMS-PPIC and the LMS-PPIC 
methods are compared with each other. 

• Computing fiiZ{n) = fxi y^sffip , L times more than 

M 

LMS-PPIC, and computing ^ - 1| in each 

m— 1 

iteration of each stage of PLMS-PPIC, is the difference 
between it and the LMS-PPIC method. 

• Because the step-sizes of all individual NLMS algorithms 
of the proposed method are given from a stable operation 
range, all of them converge fast or slowly. Hence the 
PLMS-PPIC is a stable method. 

• As we expected and our simulations show, choosing 
the step-size as a decreasing function of system loads 
(based on relation (fTTT l) improves the performance of 
both NLMS algorithm in LMS-PPIC and parallel NLMS 
algorithms in PLMS-PPIC methods in such a way that 
there is no need for the third stage, i.e. both the LMS- 
PPIC and PLMS-PPIC methods get the optimum weights 
in the second stage. However only when the channel is 
time varying, the third stage is needed, e.g. [3] 

• Increasing the number of parallel NLMS algorithms L 
in PLMS-PPIC method increases the complexity, while 
it improves the performance as well. 

• As our simulations show, the LMS-PPIC method is 
more sensitive to the Channel loss, near-far problem or 
unbalanced channel gain compared to the PLMS-PPIC. 

In the following section, some examples are given to illustrate 
the effectiveness of our proposed methods. 



IV. Simulations 

In this section we have considered some simulation ex- 
amples. Examples [T][3] compare the conventional, the LMS- 
PPIC and the PLMS-PPIC methods in three cases: balanced 
channels, unbalanced channels and time varying channels. 
Example □ is given to compare LMS-PPIC and PLMS-PPIC 
in the case of balanced channels. 

Example 1: Balanced channels: Consider the system 
model dU in which M users, each having their own codes of 
length N, send their own bits synchronously to the receiver 
and through their channels. The signal to noise ratio (SNR) 
is OdB. In this example we assume that there is no power- 
unbalanced or channel loss. The step-size of the NLMS algo- 
rithm in LMS-PPIC method is ^ = 0.1(1- y/^^) and the set 
of step-sizes of the parallel NLMS algorithms in P LMS- PPIC 
method is 6 = {0.01, 0.05, 0.1, 0.2, • • • , 1}(1 - \J i.e. 

/.I = O.Ol(l-y^), • • • , - 0.2(l-y^), • • • ,Mi2 = 

(1 — \J ^^M^ )- Figure [T] shows the average bit error rate (BER) 
over all users versus M, using two stages when = 64 
and N = 256. As it is shown, while there is no remarkable 
performance difference between all three methods for N = 64, 
the PLMS-PPIC outperforms the conventional and the LMS- 
PPIC methods for N = 256. Simulations also show that there 
is no remarkable difference between results in two stage and 
three stage scenarios. 

Although LMS-PPIC and PLMS-PPIC are structured based 
on the assumption of no near-far problem, these methods 
(especially the second one) have remarkable performance in 
the cases of unbalanced and/or time varying channels. These 
facts are shown in the two upcoming examples. 

Example 2: Unbalanced channels: Consider example [T] 
with power unbalance and/or channel loss in transmission 
system, i.e. the true model at stage s is 

M 

r{n) = 0mw'^ai^-^'> c„^{n) + v{n), (17) 

m— 1 

where < ^ 1 for all 1 m M. Both the LMS-PPIC 
and the PLMS-PPIC methods assume the model and their 
estimations are based on observations {r{n), X^{n)}, instead 
of {r{n),GX''{n)}, where the channel gain matrix is G = 
diag(/3i, /32, • • • , /3m)- In this case we repeat example [T] We 
randomly get each element of G from (0,0.3]. Results are 
given in Figure |2] As it is shown, in all cases the PLMS-PPIC 
method outperforms both the conventional and the LMS-PPIC 
methods. 

Example 3: Time varying channels: Consider example [T] 
with time varying Rayleigh fading channels. In this case 
we assume the maximum Doppler shift of 40HZ, the 
three-tap frequency-selective channel with delay vector of 
{2 X 10-^2.5 X 10-^3 X lO^'^jsec and gain vector of 
{—5, —3, — lOjdB. Results are given in Figure[3] As it is seen 
while the PLMS-PPIC outperforms the conventional and the 
LMS-PPIC methods when the number of users is less than 



30, all three methods have the same performance when the 
number of users is greater than 30. 

V. Conclusion 

In this paper, parallel interference cancelation using adaptive 
multistage structure and employing a set of NLMS algorithms 
with different step-sizes is proposed. According to the pro- 
posed method, in each iteration the parameter estimate is 
chosen in a way that its corresponding algorithm has the best 
compatibility with the true parameter. Because the step-sizes 
of all algorithms are chosen from a stable range, the total 
system is therefore stable. Simulation results show that the new 
method has a remarkable performance for different scenarios 
including Rayleigh fading channels even if the channel is 
unbalanced. 
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Fig. 1. The BER of the conventional, the LMS-PPIC, and the PLMS-PPIC 
methods versus the system load in balanced chaimel, using two stages for 
AT = 64 and AT = 256. 
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Fig. 2. The BER of the conventional, the LMS-PPIC, and the PLMS-PPIC 
methods versus the system load in unbalanced channel, using two stages for 
AT = 64 and AT = 256. 
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Fig. 3. The BER of the conventional, the LMS-PPIC, and the PLMS-PPIC 
methods versus the system load in time varying Rayleigh fading chaimel using 
two stages for AT = 64 and AT = 256. 



TABLE I 

The procedure of the LMS-PPIC method 



> 



for m = 1, 2, • • • , M 



a%, = sign real '•(w)Cm(w) 



VF-(o) = K(o),---,u,i,(o)]^' = [o,-..,o]^ 



for s = 1, 2, • • • , S 



•a 
o 

60 



for n = 1,2, •••,Af 



for m = 1,2, •••,M 



X''{n) = [a{ "^'ci(n), Qj'' ^^C2(n), • • • , ^^CM(n)]^ 

e(n) = r(n) - w'^ {n - l)X'{n) 

Z{n) = J^l^e{n) 

^{n) =W{n-l)+ nZ{n) 



qUn) = r{n)- ^ w^, {N)a^y^ c^, {n) 

m =l,m 

Qm = sign I real | q?n{n)c'^{n)^^ 



TABLE II 

The procedure of the PLMS-PPIC method 



for m = 1, 2, • • • , M 



> 



am = sign <^ real < J2 r{n)c, 



w^jo) = [wm, - ■ ■ ,wit{o)v = [0, - ■ ■ ,or 



for s = 1, 2, • 



o 

60 



I/) 

z; 



for n = 1, 2, • • • , iV 



for m = 1,2, •••,M 



X'{n) = [a'r'>ci{n),a'r'>C2{n), ■ ■ ■ ,a'Z-'>CM{n)r 
e(n) = r(n) - w'^(n - l)X'{n) 

min = oo, / = 1 



for A; = 1,2, •••,L 



M 

if E ||«;^,fc(n)|-l| <min: 

M 

niin= - 1| 

I = k 



qUn) = r(n)- ^ (iV)a^r^'c^' (n) 

m — l.m T^m 

am = sign I real | E^ gm(«)cm(«) 



